Confidence-Based Robot Policy Learning from Demonstration

نویسندگان

  • Sonia Chernova
  • Christopher Atkeson
  • Avrim Blum
  • Cynthia Breazeal
چکیده

The problem of learning a policy, a task representation mapping from world states to actions, lies at the heart of many robotic applications. One approach to acquiring a task policy is learning from demonstration, an interactive technique in which a robot learns a policy based on example state to action mappings provided by a human teacher. This thesis introduces Confidence-Based Autonomy, a mixed-initiative single robot demonstration learning algorithm that enables the robot and teacher to jointly control the learning process and selection of demonstration training data. The robot to identifies the need for and requests demonstrations for specific parts of the state space based on confidence thresholds characterizing the uncertainty of the learned policy. The robot’s demonstration requests are complemented by the teacher’s ability to provide supplementary corrective demonstrations in error cases. An additional algorithmic component enables choices between multiple equally applicable actions to be represented explicitly within the robot’s policy through the creation of option classes. Based on the single-robot Confidence-Based Autonomy algorithm, this thesis introduces a task and platform independent multi-robot demonstration learning framework for teaching multiple robots. Building upon this framework, we formalize three approaches to teaching emergent collaborative behavior based on different information sharing strategies. We provide detailed evaluations of all algorithms in multiple simulated and robotic domains, and present a case study analysis of the scalability of the presented techniques using up to seven robots.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scalability of Confidence-Based Autonomy Multi-Robot Demonstration Learning

In this paper, we present the first application of demonstration learning to more than two robots and perform an analysis of the scalability of the Confidence-Based Autonomy (CBA) multi-robot demonstration learning algorithm. Through experimental evaluation using up to seven Sony AIBO robots, we examine how the number of robots being taught by a human teacher at the same time affects the number...

متن کامل

Flexible Demonstration Learning System for Variable Number of Robots

In this paper, we present flexMLfD, a robot independent and task independent demonstration learning system that supports a variable number of robot learners. Our approach is based on the Confidence-Based Autonomy (CBA) demonstration learning algorithm, which provides the means for a single robot to learn a task policy through interaction with a human teacher. The generalized representation and ...

متن کامل

Confidence-Based Multi-Robot Learning from Demonstration

Learning from demonstration algorithms enable a robot to learn a new policy based on demonstrations provided by a teacher. In this article, we explore a novel research direction, multi-robot learning from demonstration, which extends demonstration based learning methods to collaborative multi-robot domains. Specifically, we study the problem of enabling a single person to teach individual polic...

متن کامل

RGame: Embodied Gaming for Robot Learning by Demonstration

We here demonstrate robot learning from demonstration using interactive tutelage. Our Dogged Learning architecture (introduced in (Grollman and Jenkins 2007a) and shown in Figure 1) combines concepts of teleoperative demonstration, mixed-initiative control, feedback and transparency, and real time policy inference. It is designed to be abstract, applicable to many platforms (robots), demonstrat...

متن کامل

Confidence-Based Demonstration Selection for Interactive Robot Learning

Effective learning by demonstration techniques enable complex robot behaviors to be taught from a small number of demonstrations. Demonstrations are obtained based on a selection algorithm, which governs which states are labeled by the human teacher. In this work, we examine selection algorithms used by the robot to request demonstration examples. Previous approaches typically rely on a fixed c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009